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Abstract 

We consider existential rules (aka Datalogi) as 
a formalism for specifying ontologies. In recent 
years, many classes of existential rules have been 
exhibited for which conjunctive query (CQ) entail- 
ment is decidable. However, most of these classes 
cannot express transitivity of binary relations, a fre¬ 
quently used modelling construct. In this papefl 
we address the issue of whether transitivity can be 
safely combined with decidable classes of existen¬ 
tial rules. First, we prove that transitivity is incom¬ 
patible with one of the simplest decidable classes, 
namely aGRD (acyclic graph of rule dependen¬ 
cies), which clarifies the landscape of ‘finite expan¬ 
sion sets’ of rules. Second, we show that transitiv¬ 
ity can be safely added to linear rules (a subclass 
of guarded rules, which generalizes the description 
logic DL-Lite/{) in the case of atomic CQs, and also 
for general CQs if we place a minor syntactic re¬ 
striction on the rule set. This is shown by means 
of a novel query rewriting algorithm that is spe¬ 
cially tailored to handle transitivity rules. Third, for 
the identified decidable cases, we analyze the com¬ 
bined and data complexities of query entailment. 


1 Introduction 

Ontology-based data access (OBDA) is a new paradigm 
in data management, which exploits the semantic informa¬ 
tion provided by ontologies when querying data. Briefly, 
the notion of a database is replaced by that of a knowl¬ 
edge base (KB), composed of a dataset and an ontol¬ 
ogy. Existential rules, aka Datalogi, have been pro¬ 
posed to represent ontological knowledge in this con- 
text ICali et al, 2009t|Baget et al, 2009[|Baget et al, 201 lb 

Krotzsch and Rudolph, 2011) . These rules are an extension 

of function-free first-order Horn rules (aka Datalog), that 
allows for existentially quantified variables in rule heads. 

'This document is an extended and revised version of the IJ- 
CAri6 paper with the same title. It contains an appendix with 
proofs omitted from the conference version. The most recent re¬ 
vision (December 2016) adds a new result about msa (Proposition 
|2j and corrects a mistake in the complexity bounds in Theorem]^ 


The addition of existential quantification allows one to as¬ 
sert the existence of yet unknown entities and to reason 
about them, an essential feature of ontological languages, 
which is also at the core of description logics (DLs). Ex¬ 
istential rules generalize the DLs most often considered 
in the OBDA setting, like the DL-Lite and £C families 
yCalvanese ef al, 2007[ [Baader, 2003] |Lutz et al, 2009| and 

Horn DLs ||Krdtzsch et al, 20071. 


The fundamental decision problem related to OBDA is 
the following: is a Boolean conjunctive query (CQ) entailed 
from a KB? This problem has long been known to be unde- 
cidable for general existential rules (this follows e.g., from 
I jBeeri and Vardi, 1981) ). Consequently, a significant amount 
of research has been devoted to the issue of finding decidable 
subclasses with a good expressivity / tractability tradeoff. It 
has been observed that most exhibited decidable classes ful¬ 
fill one of the three following properties I Baget et ah, 2011^ : 
flniteness of a forward chaining mechanism known as the 
chase, which allows inferences to be materialized in the data 
(we call such rule sets finite expansion sets, fesfi flniteness 
of query rewriting into a union of CQs, which allows to the 
rules to be compiled into the query (finite unification sets, 
fus)', tree-like shape of the possibly infinite chase, which al¬ 
lows one to finitely encode the result (bounded-treewidth sets, 
bts). The class of guarded rules | |Calt et al, 2008| is a well- 
known class satisfying the latter property. 


Known decidable classes are able to express many useful 
properties of binary relations (e.g., inverses / symmetry) but 
most of them lack the ability to define a frequently required 
property, namely transitivity. This limits their applicability 
in key application areas like biology and medicine, for which 
transitivity of binary relations (especially the ubiquitous ‘part 
of’ relation) is an essential modelling construct. The impor¬ 
tance of transitivity has long been acknowledged in the DL 
community fHorrocks and Sattler, 1999] [Sattler, 2000| , and 
many DLs support transitive binary relations. While adding 
transitivity to a DL often does not increase the complexity 
of CQ entailment (see | |Eiter et al, 2009| for some excep¬ 
tions), it is known to complicate the design of query answer¬ 
ing procedures | |Glimm et ah, 2008l|Eiter et al, 2012) , due to 
the fact that it destroys the tree structure of the chase upon 
which DL reasoning algorithms typically rely. In contrast to 
the extensive literature on transitivity in DLs, rather little is 
known about the compatibility of transitivity with decidable 

























classes of existential mles0 A notable exception is the result 
of yGottlob et al, 20131 on the incompatibility of transitivity 
with guarded rules, which holds even under strong syntactic 
restrictions (see Section 3). 

In this paper, we investigate the issue of whether transitiv¬ 
ity can be safely added to some well-known rule classes and 
provide three main contributions. First, we show that adding 
transitivity to one of the simplest /es and /ms classes (namely 
aGRD) makes atomic CQ entailment undecidable (Theorem 
[T]). We also provide (un)decidability results for the classes 
swa and msa extended with transitivity, which yields a com¬ 
plete picture of the impact of transitivity on known/es classes. 
Second, we investigate the impact of adding transitivity to 
linear rules, a natural subclass of guarded rules which gen¬ 
eralizes the well-known description logic DL-Lite/j. We in¬ 
troduce a query rewriting procedure that is sound and com¬ 
plete for all rule sets consisting of linear and transitivity rules 
(Theorem 2), and which is guaranteed to terminate for atomic 
CQs, and for arbitrary CQs if the rule set contains only unary 
and binary predicates or satisfies a certain safety condition, 
yielding decidability for these cases (Theorem 3). Third, 
based on a careful analysis of our algorithm, we establish up¬ 
per and lower bounds on the combined and data complexities 
of query entailment for the identified decidable cases (Theo¬ 
rems!?] and |3- While the addition of transitivity leads to an 
increase in the combined complexity of atomic CQ entailment 
(which rises from PSPACE-complete to ExpTime-complete), 
the obtained data complexity is the lowest that could be ex¬ 
pected, namely, NL-complete. 

2 Preliminaries 

A term is a variable or a constant. An atom is of the form 
p{ti,... ,tk) where p is a predicate of arity k, and the ti are 
terms. We consider (unions of) Boolean conjunctive queries 
((U)CQs), which are (disjunctions of) existentially closed 
conjunctions of atoms. Note however that all results can be 
extended to non-Boolean queries. A CQ is often viewed as 
the set of atoms. An atomic CQ is a CQ consisting of a single 
atom. A fact is an atom without variables. A fact base is a 
finite set of facts. 

An existential rule (hereafter abbreviated to rule) R is a 
formula Va?V/(B[/, ^ —)• 3z H[x, z\) where B and H are 
conjunctions of atoms, resp. called the body and the head of 
R. The variables z (resp. x), which occur only in H (resp. 
in B and in H) are called existential variables (resp. frontier 
variables). Hereafter, we omit quantifiers in rules and simply 
denote a rule hy B H. For example, p{x, y) —>■ p{x, z) 
stands for Va;Vy(p(a:, y) —> 3z(p(x,z))). A knowledge base 
(KB) 1C = (J^, TV) consists of a fact base T and a finite set 
of rules TZ. The (atomic) CQ entailment problem consists in 
deciding whether /C |= Q, where A;] is a KB viewed as a first- 
order theory, Q is an (atomic) CQ, and |= denotes standard 
logical entailment. 

Query rewriting relies on a unification operation between 
the query and a rule head. Care must be taken when handling 

^Since the conference version of this paper, the compatibility of 
transitivity with frontier-one rules (a bts class that has close connec¬ 
tions to Horn DLs) has been shown |Amarilli et al., 20T^. 


existential variables: when a term t of the query is unified 
with an existential variable in a rule head, all atoms in which 
t occurs must also be part of the unification, otherwise the 
result is unsound. Thus, instead of unifying one query atom 
at a time, we have to unify subsets (“pieces”) of the query, 
hence the notion of a piece-unifier defined next. A partition 
P of a set of terms is said to be admissible if no class of 
P contains two constants; a substitution a can be obtained 
from P by selecting an element in each class Ci of P, 
with priority given to constants, and setting a{t) = et for all 
t € Ci- A piece-unifier of a CQ Q with a rule R = B H 
is a triple p, = {Q', H', P^), where Q' C Q, H' C H and 
is an admissible partition on the terms of Q' U H' such that: 

1. <j{H') = a{Q'), where a is any substitution obtained 
from P^; 

2. if a class Ci in P^ contains an existential variable, then 
the other terms in Ci are variables from Q' that do not 
occur in {Q \ Q'). 

We say that Q' is a piece (and p is a single-piece unifier) if 
there is no non-empty subset Q" of Q' such that P^ restricted 
to Q" satisfies Condition 2. From now on, we consider only 
single-piece unifiers, which we simply call unifiers. The (di¬ 
rect) rewriting of Q with R w.r.t. p is a{Q \ Q') U cr(P) 
where cr is a substitution obtained from P^. A rewriting of 
Q w.r.t. a set of rules P is a CQ obtained by a sequence 
Q = Qo) ■ • ■ : Qn (n > 0) where for all i > 0, Qi is a di¬ 
rect rewriting of Qi-i with a rule from TZ. For any fact base 
P, we have that (F,TZ \= Q iff there is a rewriting Qn of Q 
w.r.t. TZ such that P |= Qn I jKonig et al, 20131 . 

Example 1 Consider the rule R = h{x) —>■ p{x, y) and CQ 
Q = q{u) Ap{u,v) Ap{w,v) A r{w). If p(u,v) is unified 
with p{x, y), then v is unified with the existential variable y, 
hence p(w, v) has to be part of the unifier. The triple p = 
{{p{u, v),p{w, u)}, {p{x, y)}, {{a:, u, ■»}{«, y}} is a unifier. 
The direct rewriting ofQ associated with the substitution a = 
{x ^ u,w ^ u,y ^ v} is h(u) A q{u) A r(^u). 

We now define some important kinds of rule sets (see e.g., 
|Mugnier, 2011) for an overview). A model M of a KB K, 
is called universal if for any CQ Q, M is a model of Q iff 
K, \= Q. A rule set TZ is a finite expansion set (fes) if any KB 
(P, TZ) has a finite universal model. It is a bounded-treewidth 
set (bts) if any KB (P, TZ) has a (possibly infinite) universal 
model of bounded treewidth. It is a finite unification set (fus) 
if, for any CQ Q, there is a finite set S of rewritings of Q 
w.r.t. TZ such that for any fact base P, we have P,TZ\= Q iff 
there is Q' G S' such that P |= Q'. 

A Datalog rule has no existential variables, hence Datalog 
rule sets are fes. Other kinds of fes rules are considered in the 
next section. A rule B ^ H is guarded if there is an atom 
in B that contains all the variables occurring in B. Guarded 
rules are bts. A linear rule has a body composed of a sin¬ 
gle atom and does not contain any constant. Linear rules are 
guarded, hence bts, moreover they are fus. 

As a special case of Datalog rules, we have transitivity 
rules, of the form p{x, y) A p{y, z) —>• p{x, z), which are 
not/Mi. A predicate is called transitive if it appears in a tran¬ 
sitivity rule. If C is a class of rule sets, C-\-trans denotes the 
class obtained by adding transitivity rules to rule sets from C. 











3 Combining/es /fus and Transitivity 

A large hierarchy of fes classes is known (see e.g., 
ICuenca Gran et ai, 2013) for an overview). Beside Datalog, 
the simplest classes are weakly-acyclic (wa) sets, which pre¬ 
vent cyclic propagation of existential variables along predi¬ 
cate positions, and aGRD (acyclic Graph of Rule Dependen¬ 
cies) sets, which prevent cyclic dependencies between rules. 
Datalog is generalized by wa, while wa and aGRD are incom¬ 
parable. Some classes generalize wa by a finer analysis of 
variable propagation (up to super-weakly acyclic (swa) sets). 
Most other/es classes generalize both wa and aGRD. 

We show that aGRD-\-trans is undecidable even for atomic 
CQs. Since aGRD is both fes and fus, this negative result also 
transfers to fes-\-trans and fus-t-trans. 

Theorem 1 Atomic CQ entailment over aGRD-i-trans KBs is 
undecidable, even with a single transitivity rule. 

Proof: The proof is by reduction from atomic CQ entailment 
with general existential rules (which is known to be undecid¬ 
able). Let 7^ be a set of rules. We first translate IZ into an 
aGRD set of rules TiP. We consider the following new predi¬ 
cates: p (which will be the transitive predicate) and, for each 
rule Ri G TZ, predicates ai and bi. Each rule Ri = Bi ^ Hi 
is translated into the two following rules: 

• R\ = Bi^ ai[x,zi) Ap{zi,Z2) Ap{z2,zf) Abr{zf) 

• Rf = ai{x,zi) Ap{zi,Z2) Abi{z2) Hi 

where Zi,Z 2 and z^ are existential variables and x are the vari¬ 
ables in Bi. 

Let 7^“ = {i?h I Ri G TZ}, and let GRD(JZ‘^) be the 
graph of rule dependencies of TZ°‘, defined as follows: the 
nodes of GRD{TZ°‘) are in bijection with 7^“, and there is an 
edge from a node to a node i ?2 if the rule i ?2 depends on 
the rule Ri, i.e., if there is a piece-unifier of the body of i ?2 
(seen as a CQ) with the head of Ri. 

We check that for any Ri G TZ, R] has no outgoing edge 
and R^ has no incoming edge (indeed the Zj are existential 
variables). Hence, in GRD{TZ°') all (directed) paths are of 
length less or equal to one. It follows that GRD{TZ°') has no 
cycle, i.e., 72.“ is aGRD. 

Let R* be the rule stating that p is transitive. Let TZ' = 
72“ U {7?*}. The idea is that allows to “connect” rules 
in 72“ that correspond to the same rule in 72. For any fact 
base IF (on the original vocabulary), for any sequence of rule 
applications from F using rules in 72, one can build a se¬ 
quence of rule applications from F using rules from 72', and 
reciprocally, such that both sequences produce the same fact 
base (restricted to atoms on the original vocabulary). Hence, 
for any F and Q (on the original vocabulary), we have that 
F,TZ^QmF,TZl ^Q. □ 

Corollary 1 Atomic CQ entailment over fus-Htrans or 
fes-Htrans KBs is undecidable. 

Most known/es classes that do not generalize aGRD range 
between Datalog and swa (inclusive). It can be easily checked 
that any swa set of rules remains swa when transitivity rules 
are added (and this is actually true for all known classes be¬ 
tween Datalog and swa). 


Proposition 1 The classes swa and swa-h trans coincide. 
Hence, swa-t trans is decidable. 


Proof: It suffices to note that the addition of transitivity rules 
does not create new edges in the ‘SWA position graph’ from 
[Cuenca Grau et ai, 20I3| . □ 


The only remaining/es class that is not covered by the pre¬ 
ceding results, namely Model Summarizing Acyclicity (msa) 
from I Cuenca Grau et ai, 2013| , can be shown to be incom¬ 
patible with transitivity rules: 


Proposition 2 Atomic CQ entailment over msa-i- trans KBs is 
undecidable. 


It follows that the effect of transitivity on the currently 
known fes landscape is now quite clear, which is not the 
case for /ms classes. In the following, we focus on a well- 
known /ms class, namely linear rules. We show by means of 
a query rewriting procedure that query entailment over lin- 
ear-t-trans KBs is decidable in the case of atomic CQs, as 
well as for general CQs if we place a minor safety condi¬ 
tion on the rule set. Such an outcome was not obvious in the 
light of existing results. Indeed, atomic CQ entailment over 
guarded-t-trans rules was recently shown undecidable, even 
when restricted to rule sets that belong to the two-variable 
fragment, use only unary and binary predicates, and contain 
only two transitive predicates [Gottlob et al., 2013| . More¬ 
over, inclusion dependencies (a subclass of linear rules) and 
functional dependencies (a kind of rule known to destroy tree 
structures, as do transitivity rules) are known to be incompat¬ 
ible [Chandra and Vardi, 1985[ . 


4 Linear Rules and Transitivity 

To obtain finite representations of sets of rewritings involv¬ 
ing transitive predicates, we define a framework based on the 
notion of pattern. 

4.1 Framework 

To each transitive predicate we assign a pattern name. Each 
pattern name has an associated pattern definition P := 
Oil... |afc, where each Oi is an atom that contains the spe¬ 
cial variables and fi2. A pattern is either a standard pat¬ 
tern P[fi, ^ 2 ] or a repeatable pattern P'^[ti,t 2 ], where P is a 
pattern name and ti and t 2 are terms. A union of patterned 
conjunctive queries (UPCQ) is a pair (Q, P), where Q is a 
disjunction of conjunctions of atoms and patterns, and P is a 
set of pattern definitions that gives a unique definition to each 
pattern name occurring in Q. A patterned conjunctive query 
(PCQ) Q is a UPCQ without disjunction. For the sake of sim¬ 
plicity, we will often denote a (U)PCQ by its first component 
Q, leaving the pattern definitions implicit. 

An instantiation T of a UPCQ (Q, P) is a node-labelled 
tree that satisfies the following conditions: 

• the root of T is labelled by Q G Q; 

• the children of the root are labelled by the patterns and 
atoms occurring in Q; 

• each node that is labelled by a repeatable pattern 

[7i, ^ 2 ] may be expanded into A: > 1 children labelled 
respectively by P[ti,xi], P[xi, X 2 ], ■. ■, P[xk-i,t 2 ], 
where the Xi are fresh variables; 
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Figure 1; Instantiations of a PCQ 

• each node labelled by a standard pattern may 

be expanded into a single child whose label is obtained 
from an atom a in the pattern definition of P in P by 
substituting (resp. ^2) by ti (resp. t 2 ), and freshly 
renaming the other variables. 

For brevity, we will often refer to nodes in an instantiation 
using their labels. 

The instance associated with an instantiation is the PCQ 
obtained by taking the conjunction of the labels of its leaves. 
An instance of a UPCQ is an instance associated with one 
of its instantiations. An instance is called full if it does not 
contain any pattern, and we denote by full{Q,¥) the set of 
full instances of (Q, P). 

Example 2 Let (Q,P) be a PCQ, where Q = P^[a,z] A 
P,^[z,b] A Si(a, &) and P contains the pattern definitions: 
Pi := pi(#1,#2)|s2(#1,j/,#2) and P 2 := P2(#l,#2)| 
■S2(#2,J/, #1). 

Two instantiations of Q are displayed in Figure [7] The 
smaller instantiation (within the dotted lines) gives rise to the 
(non-full) instance Qi = Pi[a, z] A P2[z,a;i] A P2[a;i,&] A 
Si (a, b). By expanding the three nodes labelled by patterns 
according to the definitions in P, we may obtain the larger in- 
stantation (occupying the entire figure), whose associated in¬ 
stance Q 2 = S 2 {a,yo, z) f\S 2 {xi,yi, z) f\p 2 (xi,b) f\si{a,b) 
is a full instance for (Q, P). 

A UPCQ (Q,P) can be translated into a set of Data- 
log rules Up and a UCQ Qq as follows. For each defi¬ 
nition P := ai(<i)| ... \ak{tk) in P, we create the transi¬ 
tivity rule p^{x,y) Ap^(y,z) p'^{x,z) and the rules 
ai(U) —S’ P''’(#l)#2) (1 < i < k). The UCQ Qq is ob¬ 
tained from Q by replacing each repeatable pattern P+ [f 1 , ^ 2 ] 
by the atom p'^{ti , ^ 2 )- Observe that Up is non-recursive ex¬ 
cept for the transitivity rules. The next proposition states that 
(Ftp, Qq) can be seen as a finite representation of the set of 
full instances of (Q, P). 

Proposition 3 Let T be a fact base and (Q, P) be a UPCQ. 
Then P, Ftp \= Qq iff P )= Q for some Q G full(Q, P). 

A unifier p, = (Q\ H, P^) of a PCQ is a unifier of one 
of its (possibly non-full) instances such that Q' is a set of 
(usual) atoms. We distinguish two types of unifiers (internal 
and external), defined next. 

Let T be an instantiation, Q be its associated instance, and 
T — {Q'iH,Pu) be a unifier of Q. Assume T contains a 
repeatable pattern P“'"[fi, ^2] that is expanded into P[tto, mi], 
..., P[uk,Uk+i], where uq = U and Uk+i = ^ 2 - We call 


P[ui, Ui+i] relevant for p if it is expanded into an atom from 
Q'. Because we consider only single-piece unifiers (cf. Sec. 
EJ, it follows that if such relevant patterns exist, they form a 
sequence P[ui, Ui+i],P[Mi+i, Mi+ 2 ],..., P[uj-i,Uj\. Terms 
Ui and Uj are called external to P“'"[fi, ^ 2 ] w.r.t. /r; the other 
terms occurring in the sequence are called internal. The uni¬ 
fier p is said to be internal if all atoms from Q' are expanded 
from a single repeatable pattern, and no external terms are 
unified together or with an existential variable; otherwise p is 
called external. 

Example 3 Consider Q 2 from Example \2} and the rules 
Ri = si{x',y') -A p 2 {x',y') and R 2 = si{x',y') 
S 2 {x',y', z'). The unifier of Q 2 with Ri that unifies p 2 {xi,b) 
with P 2 {x', y') is internal. The unifier of Q 2 with R 2 that uni¬ 
fies { 32 ( 0 , yo, z), S 2 {xi, yi, z)} with S 2 {x', y', z') is external 
because it involves two repeatable patterns. 

4.2 Overview of the Algorithm 

Our query rewriting algorithm takes as input a CQ Q and a set 
of rules TZ = TIlU TZt, with TZl a set of linear rules and TIt 
a set of transitivity rules, and produces a finite set of Datalog 
rules and a (possibly infinite) set of CQs. The main steps of 
the algorithm are outlined below. 

Step 1 For each predicate p that appears in TZt, create a pat¬ 
tern definition P := p(#l, #2), where P is a fresh pattern 
name. Call the resulting set of definitions Pq. 

Step 2 Let be the result of considering all of the rule bod¬ 
ies in P/, and replacing every body atom p(f 1 , ^ 2 ) such that p 
is a transitive predicate by the repeatable pattern P+[<i, f 2 ]- 
Step 3 (Internal rewriting) Initialize P to Pq and repeat the 
following operation until fixpoint: select a pattern definition 
P G P and a rule R G RTf and compute the direct rewriting 
of P w.r.t. P and R. 

Step 4 Replace in Q all atoms p{ti , ^ 2 ) such that p is a transi¬ 
tive predicate by the repeatable pattern P+ [Q, 12 ], and denote 
the result by Q+. 

Step 5 (External rewriting) Initialize Q to {Q'*'} and repeat 
the following operation until fixpoint: choose Qi G Q, com¬ 
pute a direct rewriting of Qi w.r.t. P and a rule from 
and add the result to Q (except if it is isomorphic to some 
Qj G Q). 

Step 6 Let Ftp be the Datalog translation of P, and let Qq be 
the (possibly infinite) set of CQs obtained by replacing each 
repeatable pattern P+ [Q , 12 ] in Q by p+ (f 1 , < 2 )■ 

The rewriting process in Step 3 is always guaranteed to ter¬ 
minate, and in Section |6] we propose a modification to Step 
5 that ensures termination and formulate sufficient conditions 
that preserve completeness. When Qq is finite (i.e., it is a 
UCQ), it can be evaluated over the fact base saturated by Flp, 
or alternatively, translated into a set of Datalog rules, which 
can be combined with Ftp and passed to a Datalog engine 
for evaluation. Observe that the construction of Ftp is query- 
independent and can be executed as a preprocessing step. 

5 Rewriting Steps in Detail 

A PCQ that contains a repeatable pattern has an infinite num¬ 
ber of instances. Instead of considering all instances of a 



PCQ, we consider a finite set of ‘instances of interest’ for 
a given rule. Such instances will be used for both the internal 
and external rewriting steps. 

Instances of interest Consider a PCQ (Q, P) and a rule 
R € 7Z^ with head predicate p. The instantiations of interest 
of (Q,P) w.r.t. R are constructed as follows. For each re¬ 
peatable pattern P^[ti,t 2 ] in Q, let ..., a^. be the atoms 
in the definition of Pi with predicate p. If rii > 0, then ex¬ 
pand P^ \ti , ^ 2 ] into k standard patterns, where 0 < k < 
min{arity{p),ni) -|- 2 , and expand each of these standard 
patterns in turn into some a}. An instance of interest is the 
instance associated with an instantiation of interest. 

Example 4 Reconsider Q, Q 2 and R 2 from Examples^and 
|5] Q 2 is not an instance of interest of Q w.r.t. R 2 since 
P 2 [xi, b] is expanded into p(^l, ^ 2 ) whereas the head pred¬ 
icate of R 2 is S 2 . If we expand P 2 [a;i, b] with S 2 (# 2 , y, ^1) 
instead, we obtain the instance of interest = 82 ( 0 ,, i/q, z)A 
S2ixi,yi,z) A S2{b, y2,xi) A si(a, &). 

We next show that the set of unifiers computed on the in¬ 
stances of interest of a PCQ ‘captures’ the set of unifiers com¬ 
puted on all of its instances. 

Proposition 4 Let (Q, P) be a PCQ and R G TZ'^. For every 
instance Q of {Q, P) and unifier p of Q with R, there exist an 
instance of interest Q' o/(Q,P) w.r.t. R and a unifier p' of 
Q' with R such that p' is more general^ than p. 

5.1 Internal Rewriting 

Rewriting w.r.t. internal unifiers is performed ‘inside’ a re¬ 
peatable pattern, independently of the other patterns and 
atoms in the query. We will therefore handle this kind of 
rewriting in a query-independent manner by updating the pat¬ 
tern definitions. 

To find all internal unifiers between instances under a re¬ 
peatable pattern ^“*"[^ 1 ,^ 2 ] and a rule head H = p{.. .), one 
may think that it is sufficient to consider each atom Oi in P’s 
definition and check if there is an internal unifier of Oi with 
H. Indeed, this suffices when predicates are binary: in an 
internal unifier, ti and t 2 are unified with distinct variables, 
which cannot be existential; thus, the terms in H are frontier 
variables, and a piece must consist of a single atom. If the 
arity of p is greater than 2 , the other variables can be existen¬ 
tial, so it may be possible to unify a path of atoms from P’s 
definition, but not a single such atom (see next example). 

Examples Let R = s{x,y) —>■ r{zi,x, Z 2 ,y) and P := 

?'(# 2 ,#l,xo,a:i)| r(#l,a: 2 ,# 2 ,a: 3 )| r(a; 4 , 2 : 5 , # 1 , # 2 ). 

There is no internal unifier of an atom in P’s definition with 
H = r{zi,x, Z 2 ,y). However, if we expand P^[ti,t 2 ] into 
a path P[ti,yf\P\yQ,yi]P\yi,t 2 ], then expand the ith pat¬ 
tern of this path into the ith atom in P’s definition, the re¬ 
sulting instance can be unified with H by an internal unifier 

^ Consider unifiers p = {Q, H, P^f) and p = {Q', H, P^/), and 
let (7 (resp. a') be a substitution associated with P^ (resp. P^/). We 
say that p is more general than p if there is a substitution h from 
a'(Q') to o-{Q) such that h{a'{Q')) C a{Q) (i.e., is a homomor¬ 
phism from a-'{Q') to cr(Q)), and for all terms x and y in Q' (J H, 
if o'[x) = o'(y) then o{h{x)) = o{h(y)). 


(with the partition {{zi,yo,X 4 }, {x,ti,X 2 ,X 5 }, { 2 : 2 , a:o, yi}, 
{y,Xl,X3,t2}}). 

Fortunately, we can bound the length of paths to be consid¬ 
ered using both the arity of p and the number of atoms with 
predicate p in P’s definition, allowing us to use instances of 
interest introduced earlier. 

A direct rewriting P' of a set of pattern definitions P w.r.t. 
a pattern name P and a rule R = B ^ H G Pj is the set 
of pattern definitions obtained from P by updating P’s defi¬ 
nition as follows. We consider the PCQ (Q = P'''[x,y],P). 
We select an instance of interest Q of Q w.r.t. R, an internal 
unifier p of Q with H, and a substitution a associated with 
p that preserves the external terms. Let B' be obtained from 
<j{B) by substituting the first (resp. second) external term by 
#1 (resp. #2). If B' is an atom, we add it to P’s definition. 
Otherwise, B' is a repeatable pattern of the form 5+ [^1, ^2] 
or 5+[^2, ^1]. Let / be a bijection on {#1, #2}: if B' is of 
the form 5+ [# 1 , # 2 ], / is the identity, otherwise / permutes 
^1 and ^2. For all Si in the definition of S, we add /(sQ to 
P’s definition. 

Note that the addition of an atom to a pattern definition is 
up to isomorphism (with ^1 and ^2 treated as distinguished 
variables, i.e., and ^2 are mapped to themselves). 

Example 6 Reconsider R, p, and the definition of P from 
Example Performing a direct rewriting w.r.t. P using R 
andp results in adding the atom s(^l, ^2) to P’s definition. 

Proposition 5 Let (Q, P) be a PCQ where P“'"[fi, ^ 2 ] occurs 
and R G Pj. For any instance Q of (Q,P), any classical 
direct rewriting Q' of Q with R w.r.t. to a unifier internal to 
P'^[ti,t 2 ], cmd any Q' G full{Q',W’), there exists a direct 
rewriting P' o/P w.r.t. P and R such that (Q,P') has a full 
instance that is isomorphic to Q'. 

5.2 External Rewriting 

Let (Q,P) be a PCQ, R G Pj, T be an instantiation of in¬ 
terest of (Q, P) w.r.t. R, Q be the instance associated with 
T, and p = [Q', H, P) be an external unifier of Q with R. 
From this, several direct rewritings of Q w.r.t. P and R can be 
built. First, we mark all leaves in T that either have the root 
as parent or are labelled by an atom in Q' , and we restrict 
T to branches leading to a marked leaf. Then, we consider 
each instantiation R that can be obtained from Q as follows. 
Replace each repeatable pattern P''"[fi,f 2 ] that has fc > 0 
children in T by one of the following: 

(i) P+[fi,a;i] A X[xi,X 2 ] A P~'~[x 2 ,t 2 ], 

(ii) P+[ti,a:i] A X[xi,t2], 

(iii) X[ti,X2] A P~'~[x2,t2], 

(iv) X[ti,t 2 ], 

where X[vi,V 2 ] is a sequence P[r;i, yi], P[yi, y 2 ],..., 
P[yfe_i, t; 2 ]. Let Qi be the instance associated with R. 

If P[a;',y'] in T has child aft), expand in R the corre¬ 
sponding P[a;, y] into a{p(t)) where p = {x' ^ x, y' i-A- y}. 
If p' = (p{Q'), H, p{P)) is still a unifier of Qi with H, we 
say that Qi is a minimally-unifiable instance of Q w.r.t. p. In 
this case, Q- = p'{Qi) \ p' (H) U p'{B) is a direct rewriting 
of Q w.r.t. P and R. 



Example 7 Reconsider Qs R 2 , ond let fj, = 

i{s 2 ia,yQ,z), S 2 {xi,yi,z)}, H 2 , {{a,xi,x'}, {j/o,2/i,2/'}, 
{z, 2 :^}}). First, we consider the instantiation that gener¬ 
ated Q^, and we remove the node labelled P 2 [xi, b] and its 
child S 2 {b,y 2 ,xi), since the latter atom is not involved in 
fjL. Next will replace the repeatable pattern P^[a,z\ (resp. 
P^[z,b]) using one of the four cases detailed above, and 
we check whether p! (obtained from p) is still a unifier. 
We obtain in this manner the following minimally-unifiable 
instances: Qi = P^[a,X 2 ]F S 2 {x 2 ,yQ, z)h S 2 {xi,yi, z)A 
P}[xi,b]h si{a,b), and Q 2 = S 2 ia,yo,z) A S 2 ixi,yi,z) 
A P 2 [xi,h\ A Si(a, &). Finally, we rewrite Qi and Q 2 
into: Qj = P^[a,x'] A si{x',y')A P 2 [x',b]A si{a,b) and 
Q 2 = si(a, y')A P^[a,b]A si{a,b). 

Proposition 6 Let (Q, P) be a PCQ and R G For every 
Q G full{Q,¥) and every classical direct rewriting Q' ofQ 
with R w.r.t. an external unifier, there is a direct rewriting Q' 
of Q w.r.t. P and R that has an instance isomorphic to Q'. 

6 Termination and Correctness 

To establish the correctness of the query rewriting algorithm, 
we utilize Propositions |3l|5] and |6] 

Theorem 2 Let Q be a CQ, {J-, TZ) be a linear-t-trans KB, 
and fllp.QQj be the (possibly infinite) output of the algorithm. 
Then: F,Ti.\= Q iff (F, lip \= Q' for some Q' G Qq. 

Regarding termination, we observe that Step 3 (internal 
rewriting) must halt since every direct rewriting step adds a 
new atom (using a predicate from Rff) to a pattern definition, 
and there are finitely many such atoms, up to isomorphism. 

By contrast. Step 5 (external rewriting) need not halt, as 
the rewritings may grow unboundedly in size. Thus, to ensure 
termination, we will modify Step 5 to exclude direct rewrit¬ 
ings that increase rewriting size. Specifically, we identify the 
following ‘problematic’ minimally-unifiable instances; 

• Q' is composed of atoms expanded from a single pattern 
^’“''[^ 1 ,^ 2 ], fi'ifi) = p'{t 2 ), and P+[fi,f 2 ] is replaced 
as in case (i), {ii) or [Hi). 

• Q' is obtained from the expansion of repeatable patterns, 
a term i of Q is unified with an existential variable of the 
head of the rule, t appears only in repeatable patterns of 
form P(^[ti, t] (resp. P^[t, ti]), and all these repeatable 
patterns are rewritten as in case (ii) P^[ti, f'] A X[t^, t] 
(resp. as in case (Hi) X[t, f'] A P(^[ti, ti]). 

We will call a direct rewriting excluded if it is based on such 
a minimally-unifiable instance; otherwise, it is non-excluded. 

Example 8 The rewriting Q'^from Example^is excluded be¬ 
cause it is obtained from the minimally-unifiable instance Q\ 
in which the repeatable patterns P^[a, z] is expanded as in 
case {ii) and P 2 [z, b] as in case (Hi), and z is unified with 
the existential variable z'. 

Proposition 7 Let (Q, P) be a PCQ and R G 7?.J. If Q' is a 
non-excluded direct rewriting of Q with R, then |Q^| < IQI- 

Let us consider the ‘modified query rewriting algorithm’ 
that is obtained by only performing non-excluded direct 


rewritings in Step 5. This modification ensures termination 
but may comprise completeness. However, we can show that 
the modified algorithm is complete in the following key cases: 
when the CQ is atomic, when there is no specialization of a 
transitive predicate, or when all predicates have arity at most 
two. By further analyzing the latter case, we can formulate a 
safety condition, defined next, that guarantees completeness 
for a much wider class of rule sets. 

Safe rule sets We begin by defining a specialization rela¬ 
tionship between predicates. A predicate q is a direct special¬ 
ization of a binary predicate p on positions {i,j} {i f ih,j f 
0) if there is a rule of the form q{u) —>■ p{x, y) such that i 
(resp. j) contains those positions of u that contain the term x 
(resp. y). It is a specialization of p on positions {i, j} if (a) it 
is a direct specialization of p on positions {*, j}, or (b) there 
is a rule of the form q{u) —>■ r{v) such that r{v) is a spe¬ 
cialization of p on positions {fc, (} and the terms occurring in 
positions {fc, 1} of v occur in positions {i,j} of u with z ^ 0 
and j f 0. We say that q is a pseudo-transitive predicate if it 
is a specialization of at least one transitive predicate. 

We call a linear-ttrans rule set safe if it satisfies the follow¬ 
ing safety condition: for every pseudo-transitive predicate q, 
there exists a pair of positions {i,j} with i f j such that 
for all transitive predicates p of which g is a specialization on 
positions {i,j}, either i € i and j € j, 01 i G j and j G i. 

Note that if we consider binary predicates, the safety con¬ 
dition is always fulfilled. Then, specializations correspond 
exactly to the subroles considered in DLs. 

Example 9 Let Ri = si{x,x,y) -A pi{x,y), R 2 = 
S2{x,y,z) -A P2{x,y), R 3 = si{x,y,z) S2{z,x,y), and 
Pi and p 2 be two transitive predicates. 

The following specializations have to be considered: si is 
a direct specialization of pi on positions {{1,2}, {3}}, S 2 
is a direct specialization of P 2 on positions {{1},{2}}, si 
is a specialization of p 2 on positions {{3},{1}}. We then 
have two pseudo-transitive predicates: Si and S 2 . By choos¬ 
ing the pair {1, 3}/or si and {1, 2}/or S 2 , we observe that 
{i?i, R 3 } satisfies the safety condition. 

If we replace R 3 by R 4 = si{x,y,z) -A S2{x,y,z), 
Si is a specialization of p 2 on positions {{1},{2}}, and 
{i?i, i? 2 , R 4 } is not safe. 

Theorem 3 The modified query rewriting algorithm halts. 
Moreover, Theorem^( soundness and completeness) holds for 
the modified algorithm if either the input CQ is atomic, or the 
input rule set is safe. 

1 Complexity 

A careful analysis of our query rewriting algorithm allows 
us to provide bounds on the worst-case complexity of atomic 
CQ entailment over linear-ttrans KBs, and of general CQ en- 
tailment over safe linear-ttrans KBs. As usual, we consider 
two complexity measures: combined complexity (measured 
in terms of the size of the whole input), and data complexity 
(measured in terms of the size of the fact base). The latter is 
often considered more relevant since the fact base is typically 
significantly larger than the rest of the input. 


With regards to data complexity, we show completeness for 
NL (non-deterministic logarithmic space), which is the same 
complexity as in the presence of transitivity rules alone. 

Theorem 4 Both (i) atomic CQ entailment over linear+trans 
KBs and (ii) CQ entailment over safe linear+trans KBs are 
Nh-complete in data complexity. 

Regarding combined complexity, we show that both prob¬ 
lems are in ExpTime, and prove that atomic CQ entailment 
over linear-Htrans KBs is ExpTime-complete. Hence, the ad¬ 
dition of transitivity rules increases the complexity of query 
entailment for atomic queries. The precise combined com¬ 
plexity of general CQ entailment over safe linear-ttrans KBs 
remains an open issue. 

Theorem 5 Both (i) atomic CQ entailment over linear+trans 
KBs and (ii) CQ entailment over safe linear+trans KBs are in 
ExpTime in combined complexity. Furthermore, atomic CQ 
entailment over linear+trans KBs is ExpTime-hard in com¬ 
bined complexity. 

8 Conclusion 

In this paper, we made some steps towards a better un¬ 
derstanding of the interaction between transitivity and de¬ 
cidable classes of existential rules. We obtained an unde¬ 
cidability result for aGRD-ntrans, hence for fes+trans and 
fus+trans. More positively, we established decidability (with 
the lowest possible data complexity) of atomic CQ entailment 
over linear-H trans KBs and general CQ entailment for safe 
linear-Htrans rule sets. The safety condition was introduced 
to ensure termination of the rewriting mechanism when pred¬ 
icates of arity more than two are considered (rule sets which 
use only unary and binary predicates are trivially safe). We 
believe the condition can be removed with a much more in¬ 
volved termination proof. 

In future work, we plan to explore the effect of tran¬ 
sitivity on fus rule classes that are incomparable with lin¬ 
ear rules, namely domain-restricted and sticky rule sets 
i Baget etal, 2011a[|Cali etal, 2010| . 
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Appendix 

Notations In the following we use several notations that 
were not included in the main paper for space restrictions. Let 
Qi and Q 2 be two CQs, if there is a homomorphism from Qi 
to < 52 , we say that Qi is more general than < 52 , and we note 
Qi > Q 2 - This definition is naturally extended to (U)PCQs: 
let (Qi, Pi) and (Q 21 P 2 ) be two (U)PCQs, if for any full in¬ 
stance Q 2 of {Q 2 , P 2 ) there is a full instance Qi of (Qi, Pi) 
such that Qi > Q 2 , we say that (Qi,Pi) is more general 
than (Q 2 ,P 2 ) andwenote (Qi,Pi) > (Q 2 ,P 2 )- 

We recall from the body of the paper that a unifier /i' = 
{Q', H, is more general than p = (Q, 77, P^) if there is 
a substitution h from a-'{Q') to (t(Q) such that h{a'{Q')) C 
<j{Q) (i.e., /i is a homomorphism from a'{Q') to a{Q)), and 
for all terms x and y in Q' C H, if cr^/ {x) = cr^' (y) then 
afj,{h{x)) = afj,{h{y)), where and a^/ are substitutions 
associated respectively with and . In what follows, we 
will write p' > p to indicate that p' is more general than p. 

Propositionl^AfoOT/c CQ entailment over msa-i-trans KBs is 
undecidable. 

Proof: 

The proof is by reduction from atomic CQ entailment with 
general existential rules. Let P be a set of facts, 7?. be a set of 
rules, and Q be an atomic CQ. 

First we consider a new transitive predicate p, which is the 
only transitive predicate we use. 

We next rewrite P into P' as follows. For each term 
t € terms{P), we add the atoms p{t, at) and p{at,f) to P', 
where at is a fresh constant. 

Then we rewrite TZ into a msa set of rules TZ^. For each 
rule R — B ^ H, we consider the rule R' — B' ^ H' 
obtained as follows. Its body B' is composed of the atoms of 
B as well as the atoms p{t, t) for each term t € terms{B). 
Its head H' contains the atoms of H as well as two atoms 
p{z, Xz) and p{xz, z), where Xz is a fresh variable, for each 
existential variable z in H. It can be checked that TZP' indeed 
satisfies the MSA property. 

Now, let be the rule expressing the transitivity ofp. It is 
clear that (P, 7^) |= Q if and only if (P', U {P"^}) h Q- 

We conclude that atomic conjunctive query entailment over 
M&4 h- trans knowledge bases is undecidable. 

□ 


Proposition 3 Let P be a fact base and (Q, P) be a UPCQ. 
Then P, Hp \= Qq iff P ^ Q for some Q S full{Q, P). 

Proof: We successively prove the two directions. 

(=>) Let T be an instantiation of (Q, P) such that there ex¬ 
ists a homomorphism tt from its associated full instance to the 
fact base P. Let us consider a node of T labelled by a stan¬ 
dard pattern atom P[ti, 12 ]. The label r{p{i)) of its child was 
obtained by choosing an atom r(t) in the pattern definition of 
P. Thus our Datalog program Hp contains the rule r(f) ^ 
p'^{^l, #2) where ^1, ^2 € t. Applying the rule according 
to the homomorphism TT op, we can add the atom p+ (Q , f2) to 
P. Let us repeat this procedure for every node of T labelled 
by a standard pattern atom. Consider next a repeatable pattern 


atom labelled P^[t,t'] whose children are respectively la¬ 
belled = ti,t 2 ], P[t 2 ,t 3 ],..., P[tk-i,tk = t']. Accord¬ 
ing to the rule applications already described, P now contains 
the atoms p+(i = ti,t 2 ),P^{t 2 ,t 3 ), ■ ■ ■ ,p+(<fe-i,4 = t'). 
Then, by successive applications of the rule in lip expressing 
the transitivity of we finally add to P the atom t'). 
Repeat this procedure for every node of T labelled by a re¬ 
peatable pattern atom. Now the root of T is labelled by some 
Q S Q. The UCQ Qq contains a CQ Q that was obtained 
from Q by replacing each repeatable pattern \ti , t 2 ] by 
P^{ti,t 2 )- Observe that the restriction of tt to the terms of Q 
is a homomorphism from Q to the fact based obtained from 
P by previous rule applications. 

(<;=) Conversely, let us consider a fact base P' obtained 
by saturating the initial fact base P with the rules of Up, 
and a homomorphism tt from some CQ Q' in the UCQ 
Qq to P'. Let us now build an instantiation T whose full 
instance can be mapped to P thanks to a homomorphism 
tt' . The root node of T is labelled by the UPCQ Q in Q 
from which Q' was obtained. Its children are labelled by 
the atoms and pattern atoms of Q. Now we define the re¬ 
striction of tt' to the terms of Q as tt. Let us now con¬ 
sider a child of the root labelled by a repeatable pattern atom 
P'^[t,t']. It follows that p+(7r(f),7r(f')) is an atom of P'. 
Since this atom is not in the initial fact base, it means that it 
has been obtained by a (possibly empty) sequence of applica¬ 
tions of the rule expressing the transitivity of on a p+-path 
7r(f) = ti,... ,tk = such that no atom p^{ti^ ti+i) in 

P' has been obtained by a transitivity rule. Then the node 
labelled [t, t'] has k + 1 children respectively labelled 
P[t = xi,X 2 ], ■ ■ ■, P[xk-i,Xk = t']. For the fresh vari¬ 
ables X 2 ,..., Xk-i, we define Tr'{xi) = ti. Repeat this pro¬ 
cedure for every repeatable pattern atom in T. Let us next 
consider a node of T labelled by a standard pattern atom 
P[x, x'\. Since that node was obtained in the previous phase, 
we know that the atom p+(7r'(a;), 7r'(a:')) is in P', and that 
it was not obtained from the application of a transitivity rule. 
Thus, the Datalog rule used to produce that atom is neces¬ 
sarily a rule obtained from the definition of the pattern P. 
Let r{t) p+(#I,#2), where #1,^2 G L be that rule. 
According to that pattern definition, we can add to the node 
labelled P[a;,a;'] a child labelled r{p{i)). Since the Datalog 
rule was applied according to a homomorphism tt", we de¬ 
fine, for every fresh variable p{t), Tr'{p{t)) = 7r"(t). Do the 
same for every standard pattern atom of T. The instance as¬ 
sociated with T is full, and tt' is a homomorphism from this 
full instance to the initial fact base. □ 

Proposition 4 Let {Q, P) be a PCQ and R G 72.J. For every 
instance Q of (Q, P) and unifier p ofQ with R, there exist an 
instance of interest Q' of (Q,P) w.r.t. R and a unifier p' of 
Q' with R such that p' is more general than p. 

Proof: Let (Q,P) be a PCQ, R G TZ'l be a rule with head 
p{i), Q be an instance of (Q,P), and p = {Q 2 ,p{i), Pu) be 
a unifier of Q with R. 

Consider a repeatable pattern P+ [fi, t 2 ] from which some 
atoms in Q 2 are expanded. If P'^[ti,t 2 ] is expanded into 


k < arityip) + 2 standard patterns in Cfl then there is 
an instance of interest Q' that expands P+ [fi, 12 ] into ex¬ 
actly k standard patterns. Thus, there is an isomorphism 
TT between the atoms expanded under P+ [ti , ^ 2 ] in Q and 
in Q'. Assume instead that P^[ti,t 2 \ is expanded in Q 
into k > arity{p) -f 2 standard patterns. We denote 
by cr a substitution associated with P„, and by P[ti = 
xo,xi],P[xi,X 2 ],... ,P[xk- 2 ,Xk-i], P[xk-i,Xk = t 2 ] the 
sequence of standard patterns expanded from P+ , 12 ] in Q, 
and let Xg and Xe (s < e) be the external terms of P'''[fi, ^ 2 ] 
w.r.t. p. The unifier is single-piece (cf. Section |2]), thus, for 
every 0 < i < k, (j(xi) = cr{zi) for some existential variable 
Zi from the head of R. 

We construct an instance Q' and function tt as fol¬ 
lows. Starting from Q, we expand every repeatable pattern 
P'^ [ti, f 2 ] that is relevant for p into e — s standard patterns 
(where e and s are defined as above and depend on the partic¬ 
ular pattern): 

P[ti = x'g,x'g_^_-i],P[x'g_^_3,x'g_^_2 \,... = f2]. 

Then for every s < i < e,we set 7r(x') = Xi, and we expand 
P[x'i,x'i_^_i] into the atom a' that is obtained from the the atom 
Qi expanded under P[xi,Xi+i] in Q by replacing every Xj 
with x'j. If e — s < arity{p) + 2, we are done. Otherwise, 
we will need to remove some patterns in order to satisfy the 
definition of instances of interest. To this end, we define a 
sequence s < U < ji < ■ ■ • < *m < jm < e of indices as 
follows: 

• We call i < j, with s<i<j<e, a matching pair if 
ai and in aj, and x' occur at the same position of 
p (hence, a{xi+i) = cr(xj)); 

• We say that a matching pair i < j is maximal w.r.t. index 
I if the following conditions hold: 

- i> I, 

- there is no matching pair i' < j' with i < i' < i 

- there is no matching pair i < j' with j' > j 

• We let < ji be the matching pair that is maximal w.r.t. 
index s -f I 

• If ik < jk is already defined, then we let ik+i < jk+i 
be the matching pair that is maximal w.r.t. index jk, if 
such a pair exists (otherwise, ik < jk is the final pair in 
the sequence). 

Now remove from Q' all of the patterns P[x'^, that 

ig < £ < jg for some 1 < g < m, as well as the atoms that 
are expanded from such patterns. We claim that there are now 
at most arity{p) + 2 patterns P[x^, x^_|_j^] below ^ 2 ] in 

Q'. Indeed, if this were not the case, we could find a matching 
pair i < j among the remaining patterns. Since ii < ji is 
maximal w.r.t. index s -f 1, and there are no further matching 
pairs starting from jm, we know that i > ii and i < jm- 
Moreover, since is still present in Q', it must be the case 
that jg < i < ig+i for some 1 < g < m. But this contradicts 
the fact that ig+i < jg+i is maximal w.r.t. jg. 

"^Strictly speaking, we mean the instantiation underlying Q, but 
to simplify the notation, here and later in the appendix, we will often 
refer to instances, leaving the instantiation implicit. 



In order for the different remaining patterns to form a se¬ 
quence, we will need to perform a renaming of terms. If there 
are n patterns left under [ti, ^ 2 ], then we will rename these 
patterns from left to right by: 

P[ti = Xq, x'I],P[x'I, X2], x" = ^2]- 

and will rename the atoms underneath these patterns accord¬ 
ingly. We will also update tt by setting 7r(x") = 7r(x') if x' 
was renamed into x" and there is no x'/ with j' < j that was 
also renamed into x". 

Let Q' be the instance obtained in this manner. We note 
that by construction, it is an instance of interest of (Q,P) 
w.r.t. R, as we only expand patterns into atoms that use the 
predicate p from the rule head, and the number of patterns 
generated from any repeatable pattern is at most arity{p) + 2. 

Regarding tt, note that a term may be shared among sev¬ 
eral repeatable patterns that are relevant for p. However, we 
can show that if a term is shared by multiple relevant patterns, 
then the (partial) mapping associated with those patterns will 
agree on the shared term, i.e. tt is well defined. First note if 
a term is shared by two repeatable patterns, then it must ap¬ 
pear as one of the distinguished terms (f 1 , f 2 ) in both patterns. 
Moreover, by tracing the above construction, we find that tt 
is the identity on such terms. 

To complete the definition of tt, we extend it to all of the 
terms of Q' by letting tt be the identity on all terms that do not 
occur underneath a developed repeatable pattern (i.e., terms 
that appear in a repeatable pattern that is not expanded, or in 
one of the standard atom of Q). Observe that tt is an injective 
function, so its inverse 7r“^ is well-defined. 

Now let Q 2 consist of all atoms in Q' fi Q 2 that are not 
expanded from a repeatable pattern (i.e., they are standard 
atoms from Q) as well as all atoms in Q' that lie under a 
repeatable pattern. 

Note that by construction every term t in Q 2 is such that 
7r(f) appears in Q 2 . We can thus define a partition of the 
terms in Q 2 L) H by taking every class C in and replac¬ 
ing every term t from Q 2 by if such a term exists, 

and otherwise deleting t\ terms fromp(f) are left untouched. 
Moreover, by the injectivity of tt, every term appears in at 
most one class, i.e., is indeed a partition. 

We aim to show that p' = {Q 2 ,p{i), P^^) is the desired 
unifier. We first show that p' is a unifier of Q' with R. In what 
follows, it will prove convenient to extend tt to the terms in 
the head atom p{t}, by letting tt be the identity on such terms. 
We will let ct be a substitution associated with p, and let a' 
be the corresponding substitution for p' defined by setting 

cr'(f) = cr(7r(f)). 

• P,^ is admissible: since tt is the identity on constants, if 
a class in P^ contains two constants c, d, then the corre¬ 
sponding class in P„ must also contain c, d (a contradic¬ 
tion). 

• (T'{p{i)) = a'{Q 2 ): since a'{p(i)) = a{p{i)) (due to 
our choice of a'), it suffices to prove that cr'(Q 2 ) ^ 
a{Q 2 )- First take some atom a that belongs to Q 2 Qa- 
Then we have 7r(Q;) = a, so cr'(Q;) = a{7r{a)) G 
a{Q 2 )- Next consider the case of an atom a that 


belongs to <32 but not Q 2 - Then a must lie be¬ 
low a repeatable pattern P'^[ti,t 2 ] that is expanded 
into k > arity{p) + 2 standard patterns P[fi = 
Xo, Xi], P[xi, X2], . . . , P[Xfe_2, Xfe_i], P[xfc-1, Xfe = 
12 ] in Q. In this case, P'''[fi, ^2] is expanded in Q' into 

P[ti = x'^, x'^+J, P[x;,+i, x;,+ 2 ]> • ■ •. ^[<- 1 . < = h], 

and each P[x',x'_|_]^] is expanded into a'. If e — s < 
arity{p) + 2 , then the atoms o' all belong to Qi- if 
we have a = o', then we have 7 r(o') = Oi, hence 
a'(a) = (T( 7 r(o')) = cr(oi) G cr{Q 2 )- The final pos¬ 
sibility is that e — s > arity{p) + 2, in which case 
some of the patterns will be removed and the remaining 
patterns will be renamed (as will be their corresponding 
atoms). Suppose that a is the atom a'^ below the pattern 
P[x", x"_|_]^], which was obtained from renaming the pat¬ 
tern P[x/i, x?j+i]. We claim that cr'(Q;) = a{n{a)) = 
a{ah), hence a'{a) G (j{Q 2 )- By examining the way 
renaming is performed, there are two situations that can 
occur: 

- 7 r(x") = Xh and 7 r(x"^_^) = Xh+i'- in this case, 
7 r(a'^) = Gfi, hence cr'(a) = a{ah)- 

- 7r(x") 7 ^ Xfi- in this case, there must exist a match¬ 
ing pair ig < jg such that h = jg, n{x'[) = x^^+i, 
and 7 r(x"_i_]^) = Xh+i- From the definition of 
matching pairs, we know that a{xi^+i) = cr{xj^). 
It follows that cr'(x") = (T( 7 r(x")) = a{xi^+i) = 
a{xh) and <t'(x"^J = (T( 7 r(x"^J) = a{xh+i). 
We can thus conclude that a'{a) = a{ah)- 

• for a contradiction, suppose the class C in P^ contains 
an existential variable z from H and either a constant or 
a variable that occurs in Q' \Q 2 - If it contains a con¬ 
stant c, then the corresponding class C in P„ will con¬ 
tain both z and c, i.e., C is not a valid class. Next sup¬ 
pose that C" contains a variable x that occurs in Q' \ Q 2 , 
which means that the corresponding class C in P„ con¬ 
tains 7 r(x). Since x that occurs in Q' \ Q' 2 , it must either 
appear in a standard atom of Q' that does not appear un¬ 
der any repeatable pattern or in a repeatable pattern that 
is not developed in Q'. In the former case, the same 
atom appears in Q \ Q 2 , and in the latter case, since Q 
is full, there is an atom in Q that is developed from the 
repeatable pattern and contains 7 r(x), but which does not 
participate in Q 2 . In both cases, we obtain a contradic¬ 
tion. 

Finally, we show that p' is more general than p: 

• <j'{Q' 2 )) Q o'{Q 2 )'- proven above. 

• if a'{ui) = a'{u 2 ), then ui,U 2 belong to the same class 
in P^, and so 7 r(Mi) and 71(^2) must belong to the same 
class in P„. 

We have thus shown that Q' is an instance of interest of 
(Q,P) w.r.t. R such that there is a unifier p' of Q' with R 
with p' > p. □ 

Proposition 5 Let {Q, P) be a PCQ where P+ [ti , 12 ] occurs 
and R G Pj. For any instance Q of (Q, P), any classical 



direct rewriting Q' of Q with R w.r.t. to a unifier internal to 
ond any Q' £ full{Q',¥), there exists a direct 
rewriting P' o/P w.r.t. P and R such that (Q, P') has a full 
instance that is isomorphic to Q'. 

Proof: Let (Q,P) be a PCQ where occurs, R = 

{B —>■ H) £ iij, Q be an instance of (Q,P), p, = 
{Q 2 , H, Pu) be a unifier internal to ^ 2 ] of Q with R, 

0! be the classical direct rewriting of Q with R w.r.t. p, and 
Q' be a full instance of (Q', P). 

Since p is internal to P'^\ti,t 2 \, all atoms in Q 2 are ex¬ 
panded from P+ [ti , ^ 2 ] in Q, and do not unify ti with t 2 , nor 
ti (resp. t 2 ) with an existential variable from H. We denote 
by P[ti = a:o,a:i],P[xi,a: 2 ],. - .,P[xk-i,Xk = ^ 2 ] the se¬ 
quence of standard patterns expanded under ^ 2 ] in Q, 

Xs and Xe (s < e) the external terms of P’*' \ti , ^ 2 ] w.r.t. p, 
and Oi the atom expanded under P[xi,Xi+i]. From Prop. ID 
there is a unifier p' of an instance of interest of Q with R 
with p' > p. Since Xg and Xe are not unified with existential 
variables, let Q 4 be the CQ obtained from Q 3 by removing 
all atoms and patterns that are not relevant for p'. Obviously 
Q 4 is an instance of interest of a PCQ of form P+ [ti, < 2 ] ■ Let 
P' be the direct rewriting of P w.r.t. p', obtained from Q 4 . 

Let Ai = {P[xi,Xi+ 4 ] I 0 < i < s}, = 

{P[xi,Xi+i] \ s < i < e}, Ar = {P[xi,Xi+i] \ e <i < k], 
and A = AiU Am U Ar. Further let AJ (resp. A^, AJ,, A') be 
the set of atoms expanded under A; (resp. Am, Ar, A) in Q. 

Initialize Q" to Q \ A' U {P+[<i,< 2 ]}- One can see that 
Q" is an instance of both (Q,P) and (Q,P'). If P is a 
not a repeatable pattern, let t = \, otherwise let S be the 
repeatable pattern in P, and S\x'Q,x'f\,..., S[x'f^_.^,x'(\ be 
the sequence expanded from S'+[xq,x^] in Q'. We denote 
by a' the atom expanded under x'_|_]^]. Then expand 
P^[ti,'t 2 ] in Q" into k' = |A/| -f \Ar \ + i standard patterns: 
P[ti = Xq,x'{], ..., P[x'^_i,x'^, = ^ 2 ]- Let TT be the func¬ 
tion defined as follows: 

• for all 0 < i < s, 7r(x") = xp 

• for all s < i < s -I- f, 7r(x") = x[_p 

• for all s -I- £ < i < k', 7r(x") = Xi_^_|_(e_s). 

Note that tt is injective, so its inverse exists. Expand all 
P[x",x"+i] with 0 <i<s or s + i<i<k (resp. 
s < i < s + t) into 7r“^(ai) (resp. 7r“^(a')). Finally, for 
all terms u in Q" for which tt is not defined (i.e., those terms 
appearing in atoms that were not expanded from the pattern 
P^[ti,t 2 ]), we set tt{u) = u. 

By construction, Q" is still an instance of (Q, P') and tt is 
an isomorphism between Q' and Q". □ 

Proposition 6 Let {Q, P) be a PCQ and R £ For every 
Q £ full{Q,¥) and every classical direct rewriting Q' of Q 
with R w.r.t. an external unifier, there is a direct rewriting Q' 
of Q w.r.t. P and R that has an instance isomorphic to Q'. 

Proof: Let (Q,P) be a PCQ, R = {B ^ H) £ P+, 
Q £ full{Q,V), p = {Qu, H, Pu) be an external unifier of 
Q with P, and Q' be the classical direct rewriting of Q with 
P w.r.t. p. 

From Proposition m there is an instance of interest <52 of 
(Q, P) such that there is a unifier p' = {Qu’, Pt, Pu') ^ P of 


Q 2 with P. We denote by a (resp. a') a substitution associ¬ 
ated with p (resp. p'). 

For any repeatable pattern P+ [fi , £ 2 ] in Q, build A, A;, Am 
and Ar as in the proof of Proposition|5]using the instance Q 2 
and unifier p'. Assume ti (or £ 2 ) is unified with an existential 
variable, then from the condition on external unifiers, either 
A; or Ar is empty. Consider the minimally-unifiable instance 
Qm of Q w.r.t. p' that replaces P+ [f 1 , £ 2 ] by: (i) Am if A; = 
Ar = 0; (ii) P+[fi,Xs], Am if Ar — fb and A; ^ 0; or (iii) 
Am,P+[xe,f 2 ] if A; =0 and Ar 0. In case (ii) (resp. 
(iii)), since all atoms in A; (resp. Ar) are not involved in p', 
Xs (resp. Xe) is not unified with an existential variable (or the 
piece condition on unifiers would not be satisfied). Therefore, 
p' is a unifier of Qm with P. We let Q' be the direct rewriting 
of Qm w.r.t. p' and P. 

Note that each repeatable pattern P+[fi, £ 2 ] in Q' expands 
into Ai A cr(P) A Ar, and in Q' there is a P+[ii,x^] (resp. 
P+[xe, £ 2 ]) iff A; (resp. Ar) is not empty. Thus consider Q" 
obtained from Q' by expanding P+ , Xg] (resp. P+ [xe, £ 2 ]) 
into k standard patterns where k = \ Ai \ (resp. k = |Ar|), and 
choose the same atoms as in AJ (resp. AJ.). Since p' > P, 
there is an homomorphism tt from a'{Qu' ) to a{Qu)- Note 
that if we restrict tt to terms in <j'{B), tt is an isomorphism. 
Furthermore, we can extend tt to Q" in the same way as we 
did in the previous proof. Thus Q" is isomorphic to Q'. □ 

Proposition 7 Let {Q, P) be a PCQ and R £ Pj. If Q' is a 
non-excluded direct rewriting of Q with R, then |Q^| < IQI- 

Proof: Let (Q,P) be a PCQ, R = {B ^ H) £ R+, Q be 
a non-excluded minimally-unifiable instance of (Q, P), p = 
{Q',H,Pu) be an external unifier of Q with P, and a the 
substitution induced by Pu. 

Note that all repeatable patterns P+[fi,f 2 ] are at most re¬ 
placed by the sequence S needed by the unifier (i.e., S C Q'), 
plus a single repeatable pattern P+[fi,Xi] (or P+[xfc,f 2 ])- 
Indeed, the only situation that would lead us to introduce 
more than one more repeatable pattern (i.e., as in External 
Rewriting case {{)) is when either ti or £2 is unified with an 
existential variable. However, if ti (or £ 2 ) is unified with an 
existential variable, because of the piece condition on uni¬ 
fiers, no unifier of P+[fi, Xi] A S' A P+[xfc, £ 2 ] can be found. 

Since \B\ = 1, we have to show that all atoms that were 
introduced when replacing a repeatable pattern are erased by 
the direct rewriting of Q w.r.t. p. 

If Q' consists of at least one atom that is not expanded from 
a pattern, the direct rewriting of Q w.r.t. p erases this atom. 

Next assume Q' consists only of atoms expanded from 
repeatable patterns. If Q' = {P+[fi,f 2 ]} and neither ti 
nor £2 is unified with an existential variable, then a{ti) = 
a{t 2 ), so the only non-excluded minimally-unifiable instance 
of Q w.r.t. p replaces P+[fi,f 2 ] only by the sequence S 
needed by the unifier (see the first condition on non-excluded 
minimally-unifiable instances). Thus, the direct rewriting 
erases P+[fi, £ 2 ]- 

Otherwise, we know that at least one P+[fi,f 2 ] from Q 
is replaced by the sequence S involved in the unifier (see 
the second condition on non-excluded minimally-unifiable 
instances), thus there is at least one P+[fi, £ 2 ] erased by the 
direct rewriting. □ 


We will break the proof of Theorem |2] into the following 
five lemmas. 

Lemma 1 Let Q be a CQ, R € Pq be the initial set of 
pattern definitions relative to R^ (see Step 1 of the algorithm 
overview), and Q+ be obtained from Q by replacing all atoms 
p{ti, t 2 ) such that p is a transitive predicate by P^[ti, tfl- If 
there is a classical direct rewriting Q' ofQ with R, then there 
is a full instance Q" o/(Q^,Po) that is isomorphic to Q'. 

Proof: Let pifi^tf) be the atom of Q that is rewritten to ob¬ 
tain Q'. Since p is a transitive predicate, it occurs in a pattern 
definition Pq in P, and Q+ contains the atom [fi , ^ 2 ] ■ In 
Q', p{ti,t 2 ) is rewritten into p{ti,xi) A p{xi,t 2 ). Let Q" 
be the full instance of (Q''',Po) that expands all repeatable 
patterns but [fi , 12 ] into a single standard pattern, expands 
^’"'’[^ 1 ,^ 2 ] into two standard patterns P[ti,x[], P[x[,t 2 ], and 
then further expands the standard patterns using the unique 
atom in each of the pattern definitions. It is clear that Q" is 
isomorphic to Q' (simply map x'^ to xi and all other terms to 
themselves). □ 

Lemma 2 Let ¥ be a set of pattern definitions, Pq C P he 
the initial set of patterns definitions built from the set TZt 
of transitivity rules, (Q,P) be a PCQ that does not contain 
any standard atom using a transitive predicate, Q be a full 
instance o/(Q,P), and be obtained from Q by replacing 
all atoms p{ti, < 2 ) with p transitive by tfl- 

Then, for every full instance Q' of ,¥if), there is a full 
instance Q" of {Q, P) such that Q" is isomorphic to Q'. 

Proof: We build the instance Q" as follows. Initialize Q" to 
the atoms and repeatable patterns occurring in Q. Next, for all 
repeatable patterns P)^ [ti , ^ 2 ] in the instantiation underlying 
Q consider each of the atom that is expanded from a child of 
[^ 1 )^ 2 ] in turn, working from left to right. If the atom 
under Pfu, v] is being considered, then do the following: 

• if p is not a transitive predicate, then add a single child 
Pi[u,v\ to P(^[ti,t2], and expand it intop(f). 

• if p is transitive, then p(f) has been replaced in Q+ 

by We also know that t consists of the 

terms u,v from Pi[u,v]. We suppose that p{i) = 
p(u,v) (hence P'^[i\ = P~^[u,v]y, a similar argu¬ 
ment can be used if the positions are reversed. Let 
P[u = xo,xi],..., P[xk-i,Xk = u] be the children 
of P'^[u,v] in Q', and at be the atom expanded un¬ 
der P[xi,xt+i\ (0 < £ < k). In place of the child 
P(^ [u, u] in Q+, we will add k children to P))^ [ti , ^ 2 ] in 
Q": P^[u = xq,xi], .. .,Pi[xk-i,Xk = u], and expand 
Pi[xj,Xj+i\ into Oj. Note that we may assume that the 
terms Xi (0 < i < k) are fresh, i.e., they do not already 
appear in Q". 

It can be verihed that the resulting full instance Q" is iso¬ 
morphic to Q'. Indeed, all atoms in Q' that are also in Q are 
present in Q". All other atoms belong to a sequence of transi¬ 
tive atoms, which we have reproduced (modulo renaming of 
variables) in Q". □ 

Lemma 3 Let Q be a CQ and TZ be a set of linear+trans 
rules, and let (Q, P) be the output of the algorithm. For any 


Q' obtained from a sequence of classical direct rewritings of 
Q with TZ, there is a PCQ (Q,P) with Q € Q and a full 
instance Q" of {Q, P) s.t. Q" is isomorphic to Q'. 

Proof: Let Q = Qo, pi,Qi, P 2 ,Q 2 , ■ ■ ■, Pk,Qk = Q' be a 
sequence of classical direct rewritings from Q to Q', and let 

,..., be the associated sequence of rules from TZ. 

We show the desired property, by induction on 0 < i < fc. 
For the base case (i = 0), we can set Qo = Q~^, since Qq = 
Q is clearly a full instance of (Qo, P)- 

For the induction step, suppose that we have Qi_i G Q 
and a full instance Q'l_i of (Qi_i,P) that is isomorphic to 
the CQ Qi-i. There are two cases to consider, depending on 
the type of the rule Ri. 

If Ri is a transitivity rule, then from Lemma [T] we know 
that Qf_i (obtained from Qi-i by replacing every transitive 
predicate p by pattern P+) is such that there is a full instance 
Qf_i of (Q^Li, P) that is isomorphic to Qi. Furthermore, we 
know that Qi-i cannot contain any standard atoms with tran¬ 
sitive predicates, since every PCQ produced in Step 5 con¬ 
tains patterns for the transitive predicates. Thus, we may ap¬ 
ply Lemma |2] and infer that Qf_i is isomorphic to some full 
instance Qi_i of (Qi_i,P). Therefore, Qi_i is isomorphic 
to Qi. 

If Ri is not a transitive rule, since Qi-i is isomorphic to 
some full instance Qi_i of (Qi_i,P), let p" be the unifier 
of Qi-i with Ri obtained from p and the isomorphism be¬ 
tween Qi-i and Qi_i. If p'l is internal to some repeatable 
pattern, then from Proposition|5] we know that there is an in¬ 
stance Q' of (Qi_i,P) that is isomorphic to Qi. Otherwise, 
from Proposition |6] there exists p' and a direct rewriting Qi 
of Qi-i with p' such that there is an instance Q' of (Qi,P) 
that is isomorphic to Qi. 

We have thus completed the inductive argument and can 
conclude that there is a PCQ (Q, P) with Q G Q and a full 
instance Q" of (Q, P) s.t. Q" is isomorphic to Q' = Qk. LI 

Lemma 4 Let Q be a CQ, (J^, TZ) be a linear+trans KB, and 
(np,(5Q) be the output of the algorithm. If P,TZ \= Q then 
T, lip \= Q'for some Q' G Qq. 

Proof: Since TF,TZ\= Q, there is a (hnite) classical rewriting 
Q' of Q with TZ such that T ^ Q'. From Proposition[3 there 
is there is a PCQ (Q, P) with Q G Q and a full instance Q" 
of (Q,P) s.t. Q" is isomorphic to Q'. Therefore, T |= Q". 
We conclude by Proposition[3 L 

Lemma 5 Let Q be a CQ, (J^, TZ) be a linear+trans KB, and 
(np,(5Q) be the output of the algorithm. If P, lip ^ Qq then 
P,TZ^Q. 

Proof: Let P be the set of pattern definitions computed in 
Step 3 of the algorithm, and lip the corresponding set of Cat¬ 
alog rules. Consider the CQ Q^+obtained from Q by replac¬ 
ing every atom p(ti,t 2 ) such that p is transitive by the atom 
P'^{ti:t 2 )- The following claim establishes the soundness of 
the internal rewriting mechanism in Step 3: 

Claim 1 IfP, Hp ^ Q^'^, then P,TZ\=Q. 

Proof of claim. Let Pq, Pi,..., Pfe = P be the sequence of 
sets of pattern definitions that led to P in Step 3, with P^+i 
being obtained from P^ by a single direct (internal) rewriting 


step. We prove by induction two distinct properties expressed 
at rank 0 <j<k: 

PI every rule in IIp^. is a semantic consequence of IIpp U TZ. 

P2 for every fact base J'' and CQ Q' (over the original vo¬ 
cabulary): 

In the second property, denotes the CQ obtained by 

replacing every atom p(fi, < 2 ) such that p is transitive by the 
atom p+(fi, ^ 2 )- Observe that P2 at rank k yields the claim: 
we simply take T' = T and Q' = Q. 

Base case (i = O).' property PI is obviously verified. 
For property P2, we note that Pq consists of the following 
rules for every transitive predicate p: the transitivity rule 
p^{x,y) Ap^{y,z) —> p'^{x,z) and the initialization rule 
p{x,y) -A p+{x,y). Clearly, if J',npj, \= then 

we have T,TZ \= Q', since if we can derive p^(a, h) using 
Ilpg, then we can also derive p(a, b) from TZ using the 
transitivity rule for p in TZ. 

Induction step for PI: we assume property PI holds for 
some rank 0 < i < k and show that it holds also for i + 1. 

Suppose that P^+i is obtained from P^ by a single direct 
rewriting step w.r.t. pattern name P and the rule R = B ^ 
H € TZ'l- Let Q = P^[x,y\, Q be the considered instance 
of interest of Q w.r.t. R, p = {Q', H, Pu) be the considered 
internal unifier of Q with H, and a be the considered sub¬ 
stitution associated with p that preserves the external terms. 
Finally, let B' be obtained from cr(i?) by substituting the first 
(resp. second) external term by (resp. #2). 

Since we know that p is an internal unifier, the external 
terms of Q' cannot be unified together or with an existential 
variable. Thus by considering Q" and Pu> obtained from Q' 
and Pu by substituting the first (resp. second) external term by 
(resp. it is clear that/i' = (Q", iF, P^') ^ ™ifi®rof 

Q" with R such that a'{B) = B', where cr' is the substitution 
associated with p' that preserves the special terms ^1 and 
#2. 

We consider two cases depending on the nature of B'. 
Case 1: The first possibility is that B' is an atom (as opposed 
to a repeatable pattern), in which case we add the following 
rule to IIp^: B' -5- p+(^l, #2). 

Let oi,..., Ofc be the atoms of Q", and let o' be the atom 
in P’s definition from which aj is obtained. Since there is 
a rewriting of {oj | 0 < j < ft} with R into B' (using the 
unifier p'), and the rule R appears in the original set of rules 
TZ, it follows that 

TZ 1= B —y A ... A Ufc 

From the induction hypothesis, we know that the rules a' —5- 
^2) (0 < j < ft) are entailed by Ihpg,TZ. We also 
know that for all 1 < j < ft, the atoms aj-i and aj share 
a variable corresponding respectively to ^2 in a}_i and to 
#1 in a'j. Thus, by applying the rules a' p+(#l,#2) 
(0 < J < ft) to the conjunction oi A ... A at, we obtain 
p+(#l,xi) Ap+(xi,X 2 ) A ... Ap+(a;fc_i, #2). Hence: 

k 

Iivo,TZ \= l\aj ^ p'^{iT 1 ,xi)Ap'^{ xi,X 2 )A. . .Ap'^{xk-i, #2) 
i=o 


Since Hp^ contains a transitivity rule for p+, we can further 
infer that 

npo l=p+(xi,X 2 ) A ... Ap+(a;fc_i,#2) ^p+(#l,#2) 

By chaining together the preceding entailments, we obtain 
nPo,P 1= (P' -5 ^p+(# 1,#2)), as desired. 

Case 2: The other possibility is that B' is a repeatable pattern 
of the form ^2] or S'+[^2, ^1]. Let / be a bijec- 

tion on {#1, #2}: if B' is of the form ^2], / is the 

identity, otherwise / permutes #1 and ^2. Then for all si 
in the definition of S, we add /(sr) to P’s definition, and we 
add the corresponding rules /(sr) —> p'^yb\,^ 2 ) to Hp^. 
Consider one such rule rule /(sr) —> #2). 

Let ai,... ,ak and af ... ,a'f. be defined as in Case 1. 
Since there is a rewriting of {aj | 0 < j < ft} with R € TZ^ 
into B', and since the rule R was obtained from a rule R' in 
TZ by replacing the transitive predicate s in the rule head by 
the repeatable pattern , it follows that 

^ 1= #2)) —> oi A ... A Ofc 

Arguing as in Case 1, we obtain 

np„,P h /(s(#l,#2)) ^p+(#l,#2) 

From the induction hypothesis, we know that that the rules 
si -A s+(#l, #2) are entailed from npg,P, and the same 
obviously holds for the rules/(sr) -5- /(s“''(#l, #2)). By 
combining the preceding entailments, we obtain Hp^,, TZ j= 
/(sr)^F+(#l,#2). 

Induction step for property P2: we assume P2 holds for 
some rank 0 < i < k and show that it holds also for i + 1 . 

Suppose now that P', IIp.^j \= (<3')++, for some fact base 
P' and CQ Q' (over the original predicates). This means that 
there is a finite derivation sequence T' = ,... ,Pm~^ 

such that P^~^ ^ and such that for all 0 < f < to, 

obtained from P^^ either (i) by a sequence of appli¬ 
cations of rules from lip. or (ii) by a sequence of applications 
of rules from Hp^^^ \ Hp^. 

In case (i), we have p/"'',np^ ^ P^+i++. Letting Pr 
be the fact base obtained by replacing every predicate in 
P++ by the corresponding predicate p, and recalling that lip. 
contains the rule p{x,y) p'^{x,y), we have P^,np^ \= 
P^^i. Applying the induction hypothesis (treating P^^i as a 
CQ), we obtain Pi,TZ\= Pe+i. 

In case (ii), we have P/'*’, (np._^^ \ HpJ \= P^f^^. From 
property PI, we obtain P}*"'', Hp^,, 7^ ^ Using the 

rule p{x,y) p'^{x,y) (that is present in Hp^,), the latter 
yields P^, IIpj,, TZ ^ -ft}!}*)- Finally, we note that if we can de¬ 
rive (a, b) from P^, Hp^, TZ, then we can also infer p{a, b) 
from P^, 7^ by using the transitivity rule for p instead of using 
p{x, y) -A p'^ix, y) and the transitivity rule forp+. Thus, we 
have Pi,TZ ^ Pi+i. 

We have thus shown that for every 0 < ^ < to, P^, 7?, ^ 
Pt+i. Since P' = Pq, by chaining these implications to¬ 
gether, we obtain P', TZ j= Pm. Using the same reasoning as 
above, we can infer Pm H Q' from Pff^ ^ . Then, 

by combining these statements, we obtain P', TZ ^ Q'. (end 
proof of claim) 



Now let Q be the set of queries computed in Step 5 by 
performing all possible external direct rewritings w.r.t. P and 
rules from , starting from Q~^, and let Qq be the set 
of CQs associated with Q (defined as in Step 6). We start 
by proving the following claim, which relates external direct 
rewriting steps to sequences of classical direct rewritings. 

Claim 2 Let Qi+i be a direct rewriting of Qi w.r.t. P. 
Then every full instance o/(Qi_|_i,P) is obtained from a se¬ 
quence of (classical) direct rewritings of some full instance of 

{Q^,n 

Proof of claim. Let (Qi+i,P) be obtained from an external 
rewriting of (, P) with rule R = B ^ H. This means 
that there is a minimally unifiable instance and a unifier 
/i = {X, H, Pu) of Q® with H (with associated substitution 
a) such that Qi+i — (j{Q^ \ ^) U cr{B). 

Let us consider a partial instance Qfj^i of (Qi_|_i,P) that 
fully instantiates (t(Q® \ X) but does not instantiate a{B) 
(we say that it is a (7(5)-excluding instance). Note that Qfj^i 
can be built equivalently by choosing a full instance Q® of 
(Q®, P), removing the atoms of X, then by applying the sub¬ 
stitution cr and adding (j{B). We can see that the classical 
direct rewriting of Q® according to p, produces More¬ 

over, since every full instance of (Q®, P) is a full instance of 
( Qi , P), we know that Q® is an instance of ( Qi , P) . 

Now consider any full instance Qi+i of (Qi+i,P). Note 
that it is a full instance of some tT(i?)-excluding instance 
(Qf+i.P)- There are two cases to consider; 

• If <j{B) is an atom, then Qi+i — Qfj^i and thus Qi+i is 
obtained from a classical direct rewriting of an instance 
of(Q„P). 

• Otherwise, if a{B) is a repeatable pattern, then Qi+i is 

obtained from expanding a{B) into a se¬ 

quence of k standard patterns, and expanding each of 
them into some atom ae. Let Bk = {ae | 1 < ^ < k}. 
Then, a{B) is generated in forward chaining from Bk 
with a sequence of applications of rules: k applications 
of transitivity rules, and k applications of the rules en¬ 
coded in P, each one stemming from a finite sequence 
of applications of rules of TZ (see Claim [T]l. Thus from 
the completeness of classical rewriting, Bk can be ob¬ 
tained from a sequence of classical direct rewritings 
from <t{B), and thus Qi+i is obtained from a sequence 
of classical direct rewritings of an instance of (Qi, P). 

(end proof of claim) 

The following claim shows the soundness of the external 
rewriting in Step 5 and completes the proof of the lemma. 

Claim 3 IfP, lip |= Qq, then P,TZ\= Q. 

Proof of claim. Suppose that lip \= Qq with Q G Q. 
We know that the PCQ Q is obtained from a finite sequence 
Qo = Qi, • ■ •, Qfc = Q of PCQs such that for all 0 < 
j < k, (Qj+i,P) is a direct external rewriting of (Qj,P). 
We will show by induction on j that -F, lip \= Qq implies 
(F,Tl\= Q for every 0 < j < k. 

The base case (j = 0) is a direct consequence of Claim [T] 
For the induction step, we assume the property is true at 
rank i, and we show that it is true at rank i -f 1. 


Suppose that -F, lip \= From Proposition [2 it 

follows that there is a full instance Qi+i of (Qi+i,P) such 
that F ^ Qi+i. By Claim |2] there is a full instance Qi 
of (Qi,P) such that Qi+i is obtained from a sequence of 
classical rewritings from Qi. Thus (from the correctness 
of the classical rewriting), there is a fact base F' such that 
F^TZ \= F' and F' \= Qi. Applying Proposition [2 we ob¬ 
tain lip \= Qq^. Now from our induction hypothesis, it 
follows that F',TZ |= Q, hence F,TZ \= Q. (end proof of 
claim) □ 

Theorem |2 Let Q be a CQ, {F, TZ) be a linear-¥trans KB, 
and (np,QQj be the output of the algorithm. Then: F,TZ \= 
Q iffF, lip 1= Q' for some Q' € Qq. 

Proof: Follows from Lemma|4]and Lemma|2 □ 

The following two lemmas show that the non-excluded 
minimally-unifiable instances are sufficient to ensure com¬ 
pleteness when the input query is atomic or when the input 
rule set satisfies the safety condition. 

Lemma 6 Let (Q, P) be a PCQ, R G R'j)), Q be an instance 
of interest of [Q, P) and fi — (Q', H, P„) be an external uni¬ 
fier of Q with R such that two external terms w.r.t. p, from 
a given pattern P^[ti,t 2 \ are unified together and with no 
existential variable. 

Every minimally-unifiable instance (Q, P) w.r.t. p that re¬ 
places P'^ [fi, ^ 2 ] os in the External Rewriting cases (i), (ii), 
or (Hi) will lead to a direct rewriting (Q(,P) that is more 
specific than (Q,P). Furthermore, for any classical direct 
rewriting Q'l of Q) with R, either (Q,P) > (Q",P) or there 
is a classical direct rewriting Q' of the minimally-unifiable 
instance of Q that replaces P~^[ti, ^ 2 ] os in case (iv) with R 
and{Q',F) > (Q",P). 

Proof: Without loss of generality, let us write Q = q\ti , ^ 2 ] A 
< 2 ] where ^[^ 1 ,^ 2 ] denotes a set of atoms where ti and 
t 2 may occur. We denote by Xg and Xe (s < e) the external 
terms of P'^\ti,t 2 \ w.r.t. p, and by A[xs,Xe] the sequence 
of atoms expanded from P+ \ti , ^ 2 ] involved in the unifier. 
Since we assume that no existential variable is unified with 
variables Xg and Xg, no atom from q can be part of the unifier. 
Consider the following minimally-unifiable instances: 

1- Ql = q[tlf2\ A P+[ti,Xg] A A[Xg,Xe\ A P+[XgF2\ 

2. Q 2 = q[tif2\ A [h = Xg, Xe] A P+[a:e, ^ 2 ] 

3- Q 3 = q[tl,t 2 ] A P+[ti,Xg] A X[Xg,Xe = t 2 ] 

By unifying Xg and Xg together, we obtain the following in¬ 
stances: 

1. q[ti,t2] A P+[ti,Xg] A A[xg,Xg] A P+[Xg,t2] 

2 . q[ti,t2]/\ A[ti,ti] A P+[ti,t2] 

3 . q[ti,t2] AP+[ti,t2] AA[t2,t2] 

Let Q' be the direct rewriting of Qi w.r.t. p with R. It is easy 
to see that Q C Q 2 and Q C Qg, thus, (Q 2 ,P) and (Q 3 ,P) 
are more specific than (Q, P). 

Let Ql be a full instance of (Qg,P). We construct a full 
instance Q of (Q,P) as follows. First note that ^[^ 1 ,^ 2 ] is 
common to both Q( and Q, so we will expand all patterns in 


q\ti,t 2 ] exactly as in Qi. Now let fci (resp. ^ 2 ) be the num¬ 
ber of children of [fi, Xs] (resp. , < 2 ]) in the instan¬ 

tiation of Qi, and expand P''"[fr, ^ 2 ] in Q into k = ki + k 2 
children; P[ti = xo,xi],... ,P[xk-i,Xk = ^ 2 ]- Expand 
each P[xi, Xi+i] with i < fci as is expanded the child of 
P'^\ti,Xs\ in Qi, and each P[xi,Xi+i] with ki < i < k as 
is expanded the {i — ki + 1)*^ child of P+[a;s, ^ 2 ] in Qi- By 
construction, there is an homomorphism from Q to Qi. We 
have thus shown that (Q, P) > (Q'l, P). 

Furthermore, let Q” be a classical direct rewriting of Q' 
with a rule R' = B' ^ H' w.r.t. uniher /r' = (Q', H', P^), 
where 1 < i < 3. If at least one atom involved in fj,' occurs 
in Q' \a{B) (where a is the substitution associated with /r), 
then, let = {Q", H\ P”} where Q" — Q' \ a{B) and 
P" is the restriction of P^ to terms occurring in Q" U H'. 
Since Q" ^ 0 and all terms from a{B) cannot connect two 
different terms from q[ti,t 2 ] (indeed, the only term shared 
between a{B) and q[ti,t 2 ] is either ti or ^ 2 ), o’(P) can be 
seen as a loop on ti (or ^ 2 ), therefore we can remove a{B) 
while preserving the unifier, i.e., /r" is a uniher of Q with 
R'. Moreover, since P" and Q" are only restrictions of P^ 
and Q' respectively, it holds that /r" > p'. Then, we denote 
by Q" the direct rewriting of Q with P' w.r.t. p" and obtain 
Q" > Q'i- The other possibility is that all atoms involved in 
/r' occur in <t{B), then, Q 2 (resp. Qg) is more specihc than Q 
since Q C Q'^ (resp. Q C Qg). Moreover, for any instance 
Qi of Q'l, one can easily build an instance Q' of Q in the 
same way as above, and see that Q' > Q'l. Thus, we have 

(Q,P)>(Q^P). □ 

Lemma 7 Let [Q, P) be a PCQ, R € Pj, Q be an instance 
of interest of { Q, P) and /i = {Q', H, P„) be an external uni¬ 
fier of Q with R such that one external term w.r.t. fifrom a 
given pattern P^\ti,t 2 \ is unified with an existential vari¬ 
able, and where all atoms in Q' are obtained from the expan¬ 
sion of a repeatable pattern. 

If Q is atomic, or if TZ^ is a set of safe linear rules, then 
every minimally-unifiable instance of {Q, P) w.r.f. fi that re¬ 
places all Pj^[fi,f 2 ] in the External Rewriting cases (ii) 
or (Hi) will lead to a direct rewriting (Q', P) that is more spe¬ 
cific than (Q,P). Furthermore, for any direct rewriting Q'l 
of Q'j^ with R, either Q > QI or there is a direct rewriting Q' 
of a minimally-unifiable instance of Q w.r.t. fi that replaces at 
least one repeatable pattern as in case (iv) and is such that 

Q' > QI- 

Proof: Let (Q, P), R, Q and p be as in the lemma statement, 
and let Pi^[fi, > Pk\P^^ ^ 2 ] be the repeatable patterns 

that are relevant for p,. For each 1 < i < fc, we denote by 
Pi[t\ = Xq,x\\, ..., Pi[x\_i,x\. = tf\ the sequenceof stan¬ 
dard patterns expanded from Pi^[ti,t\\, and we let x\. and 
x'e be the external terms of Pj~'’[fi, w.r.t. p. We 

assume without loss of generality that it is x],. that is unihed 
with an existential variable, and let Ai[x\., x\. = denote 
the atoms expanded from Pi[a:*, with Si < j < Ci. 

Since the unifier p is single-piece, all repeatable patterns 
relevant for p have to share some variable. For simplicity, we 
assume that they all share their second term, i.e. = f^ for 
all 1 < J ^ k. (The argument is entirely similar, just more 


notationally involved, if this assumption is not made.) Let us 
use t 2 for this shared term. Then we can write Q as follows: 

Q = q\t\,...,tl]A f\ Pt\t\M] 

l<i<k 

Note that t2 cannot occur in q. 

Because we have chosen the second term to be shared 
in all repeatable patterns, we only need to consider the 
minimally-unifiable instance Qm of (Q,P) w.r.t. p that re¬ 
places each P^[t\,t 2 ] by Pl^[t\,x\\, Ai[x\,xl = 12 ], i.e. 
External Rewriting case (ii). Thus, we have 

Qm = q[t\,. ■. A f\ {P+[t\,x\]AAi[xl,t2]). 

l<i<k 

Let a be the substitution associated with p. From the safety 
condition (see Section|^, we know that there is a pair of posi¬ 
tions {pi , p 2 } for the predicate p of H, such that for all atoms 
p(f) occurring in a pattern definition the terms fil and ^2 
occurs in positions {pi , P 2 } ■ We further note that the external 
terms in the concerned patterns are ^2 (which unifies with an 
existential variable in H) and the terms x], (which unify with 
a non-existential variable), and each of these external terms 
must be obtained by instantiating term or fi2. Since the 
I^i[x'sA 2 \ are unihed together, and share the same predicate 
p, it follows that all of the x* must occur in the same posi¬ 
tion (either pi or P 2 ) of p; t 2 will occur in the other position 
among Pi andp 2 . We therefore obtain; 

a{xl) = a{xl) = ... = a{xl) = x', 

where x' is the term in B that unihes with all of the x\. (Note 
that if Q is an atomic query, there is a single Ai, so the previ¬ 
ous statement obviously holds, even without the safety con¬ 
dition.) Thus, Qm becomes: 

q[t\,... ,t\] A f\ {Pif[t\,x']AAi[x',t2]). 

l<i<k 

There is an isomorphism from Q to Qm 1 < i < k} 

that maps t 2 to x'. We then observe that {Ai | 1 < i < fc} 
is exactly the set of atoms that will be erased in the direct 
rewriting = Qm \ {Ai | 1 < i < fc} U cr{B), where a is 
a substitution associated with p. Therefore, Q is isomorphic 
to \ cr{B), hence (Q,P) > (Q^,P). One can see that 
the same reasoning as in the previous proof can be applied 
here to show that any further direct rewriting Q'fj ofQ'M will 
lead to more specific queries. □ 

Theorem |3] The modified query rewriting algorithm halts. 
Moreover, Theorem{^(soudness and completeness) holds for 
the modified algorithm if either the input CQ is atomic, or the 
input rule set is safe. 

Proof: From LemmaH] we know that if we do not exclude any 
rewriting the algorithm is sound and complete, and Lemma |6] 
and |7] show that for any rewriting Q that we exclude, there 
is another rewriting Q' obtainable using only non-excluded 
direct rewritings that is more general than Q. Therefore, the 
modified algorithm (in case of an atomic CQ, or a safe rule 
set) is complete. Furthermore, excluding rewritings cannot 
comprise the soundness of the rewriting mechanism. □ 


TheoreinU Both (i) atomic CQ entailment over linear+trans 
KBs and (ii) CQ entailment over safe linear+trans KBs are 
Nh-complete in data complexity. 

Proof: Consider a CQ Q, a linear+trans rule set TZ, and a fact 
base K. Suppose that either Q is atomic or TZ satisfies the 
safety condition. Using Theorem[3l we can compute a finite 
set lip of Catalog rules and a finite set Qq of CQs with the 
property that iF,lZ\= Q iff T, Up \= Q' for some Q' G Qq. 
As Up and Qq do not depend on the fact base K, they can be 
computed and stored using constant space w.r.t. | J^|. 

To test whether K, Up |= Q' for some Q' G Qq, we pro¬ 
ceed as follows. For each rewriting Q' G Qq, we can con¬ 
sider every possible mapping tt from the variables of Q' to 
the terms of T. We then check whether the facts in Tr{Q') 
are entailed from K, Up. For every atom a G Q' over one of 
the original predicates, we can directly check if 7r(Q;) G T, 
since the rules in Up can only be used to derive facts over the 
new predicates For every atom p'^[ti,t 2 ] G Q' where 
p+ is a new predicate, we need to check whether T, Up |= 
p'''(7r(fi), 7r(f2))- Because of the shape of the rules in Up, 
the latter holds just in the case that there is a path of constants 
Cl,..., c„ with Cl = 7r(fi) and c„ = 71 (^ 2 ) such that for ev¬ 
ery 1 < i < n, there is a rule pi = Bi ^ ^ 2 ) and 

substitution di of the variables in Bi by constants in T such 
that(Ti(#l) = Ci,ai{^2) = c^+i, and (Ti(Si) G K. Tocheck 
for the existence of such a path, we guess the constants Ci in 
the path one at a time, together with the witnessing rule pi 
and substitution cFi, using a counter to ensure that the number 
of guessed constants does not exceed the number of constants 
in K. Note that we need only logarithmically many bits for 
the counter, so the entire procedure runs in non-deterministic 
logarithmic space. 

Hardness for NL can be shown by an easy reduction from 
the NL-complete directed reachability problem. □ 

Theorem|5]Z?of/! (i) atomic CQ entailment over linear+trans 
KBs and (ii) CQ entailment over safe linear+trans KBs are in 
ExpTime in combined complexity. Furthermore, atomic CQ 
entailment over linear+trans KBs is ExpTime-hard in com¬ 
bined complexity. 

The proof of Theorem |5] is provided in the following two 
lemmas. 

Lemma 8 Both (i) atomic CQ entailment over linear+trans 
KBs and (ii) CQ entailment over safe linear+trans KBs are 
in ExpTime in combined complexity. 

Proof: Consider a CQ Q, a linear+trans rule set TZ — TZl U 
TZt, with TZl a set of linear rules and TZt a set of transitivity 
rules, and a set of facts TF. Suppose that either condition (i) or 

(ii) of the lemma statement holds. It follows from Theorem[3] 
that the modified query rewriting algorithm halts and returns 
a finite set Hp of Datalog rules and a finite set Qq of CQs 
such that {F, TZ) Q iff [F, lip) |= Q' for some Q' G Qq. 

To prove membership in ExpTime, we show that: 

(i) Hp is of exponential size and can be built in exponential 
time; 

(ii) Qq is a set of exponential size, that can be built in expo¬ 
nential time, and any Q' G Qq is of linear size in Q; 


(iii) we can saturate F with Hp into F* in polynomial time 
in the size of Hp and F, and the resulting set of facts is 
of polynomial size in F', 

(iv) Qq can be evaluated over F* in exponential time. 

We denote by r the maximum arity of a predicate in TZ, 
by p the number of predicates occurring in TZ and by t the 
number of transitive predicates. 

Let us consider the construction of Hp. Since all rules 
generated in this step are linear rules and given a predicate 
s the number of non-isomorphic atoms using s is bounded 
by an exponential in r, for each transitive predicate there can 
be only exponentially many generated rules. Thus |np| = 
0{t X p X r''). For the first point, it remains to show that 
Hp can be built in exponential time. Consider the following 
algorithm: for each pattern definition P, repeat until fixpoint: 
choose a rule R = (B, H) G TZl, compute all instances of 
interest of P w.r.t. R, and if there is an internal unifier, add 
the corresponding rewriting to P’s definition. The repeat- 
able pattern P'^\ti,t 2 ] can be expanded into at most r + 2 
standard patterns (by the definition of instances of interest), 
and thus there are r -f 2 possible sizes for the instances of 
interest. Then for each of these standard patterns, we can 
choose an atom from P's definition that uses the predicate of 
H. Since there are at most r'' possible choices for instantiat¬ 
ing a standard pattern, and there are at most r -G 2 standard 
patterns to expand, we obtain the following bound: there are 
0((r -|- 2) X different instances of inter¬ 

est for a given pattern definition and a given rule. Therefore 
each step of the algorithm can be processed in exponential 
time. Since there are only exponentially many different pos¬ 
sible rewritings, the fixpoint is reached in at most exponential 
time. Hence, Point (i) runs in exponential time. 

The argument for Point [ii) proceeds similarly. The only 
difference comes from the fact that since Q might not be 
atomic, we apply the rewriting step to conjunctive queries. 
However, from Proposition |7] we know that all rewritten 
queries have size bounded by the size of Q. Therefore, by 
using the same argument as for Point (i), we know that this 
step is exponential in both the maximum arity and in the size 
of the initial query Q. 

Regarding Point (in), a single breadth-first step with all 
non-transitive rules in Hp followed by the computation of the 
transitive closure is enough to build F*. While there are ex¬ 
ponentially many non-transitive rules, each can be applied 
in polynomial time (since the body of each rule is atomic). 
Since each rule only creates atoms with transitive predicates, 
the resulting set of facts is of size \terms{F)\^ x p. Now the 
transitive closure adds at most a quadratic number of atoms 
(for each transitive predicate), and can be computed in poly¬ 
nomial time in the size of F. Therefore, Hp can be built in 
exponential time in r and is of polynomial size in | J^|. 

It remains to show that point (iv) can be done in exponen¬ 
tial time. Observe that since each query Q' G Qq is of size 
bounded by the initial query Q (Proposition |7]i, its evalua¬ 
tion can be computed in NP, thus in exponential time. Since 
there are only exponentially many queries in Qq, this step is 
also done in exponential time. 

Therefore, we can conclude that the entailment problem 


over linear+trans sets of rules with atomic query, and over 
safe linear+trans sets of rules is in ExpTime. □ 

Lemma 9 Atomic CQ entailment over linear+trans KBs is 
ExpTime-hard in combined complexity. 

Proof: To prove hardness, we can rely on a proof from 
iBienvenu and Thomazo, 2016[ . In this paper, they prove that 
Regular Path Query (RPQ) entailment over linear knowledge 
bases is ExpTime-hard. The problem is not a subproblem of 
ours, nor the contrary. However the proof uses only a partic¬ 
ular RPQ of the form {ti , ^ 2 )- This RPQ is entailed from 
if and only if the atomic CQ p{t 1 , 12 ) is entailed 
from {K,TZl U {trans{p)}). Nevertheless, we recall below 
the main lines of the proof, while reformulating it in terms 
of our problem. Note that the linear rules have a non-atomic 
head to simplify the explanations, but can be decomposed into 
atomic-headed without loss of generality. 

The reduction is from the simulation of any Alternat¬ 
ing Turing Machine (ATM) that runs in polynomial space. 
More specifically, the problem they consider is the follow¬ 
ing ExpTime-complete problem: given a PSpace ATM M, 
and a word x, does M accept xl Without loss of generality, 
they consider ATM where each non-final universal state has 
exactly two existential state successors, and each non-final 
existential state has exactly two universal state successors. 

The proof uses a single transitive predicate that we call p. 
Given an ATM M with input x, we create a predicate of arity 
polynomial in x and M, that encodes the current configura¬ 
tion of the machine (its tape and the current state and head 
position). Eurthermore, each atom encoding a configuration 
also uses a term as a “begin” and another as an “end” (re¬ 
spectively the first and last position of the predicate), these 
are used later by the transitivity rules. Linear rules are used 
to generate the transitions of the ATM. Eirst, for each transi¬ 
tion in the ATM, there is a linear rule that generates the two 
next configurations, and depending on the type of the current 
state different transitive atoms are generated as illustrated by 
Figure |2] 

The initial configuration contains two special constants b 
and e as begin and end, and the set of facts contains only the 
atom encoding this configuration. 

When the state of the current configuration s is existential, 
four atoms using predicate p are generated in the next step, the 
first two being used to link the begin of s to the begin of the 
two next configurations (since the ATM is non-deterministic 
by nature), and the last two atoms being used to link the end 
of the two next configurations to the end of s. 

When the state of the current configuration s is universal, 
three atoms using p are generated, the first one links the begin 
of s to the begin of the first next configuration, the second one 
links the end of the first next configuration to the begin of the 
second next configuration, and finally the last one links the 
end of the last next configuration to the end of s. 

Finally, when the state of the current configuration s is ac¬ 
cepting, an atom using p linking the begin of s with the end 
of s is generated. 

The idea is that linear rules simulate the run of the machine, 
and that transitivity rules connect the initial begin to the initial 
end if and only if M accepts x. 



Figure 2: Reduction from ATM simulation to atomic CQ en¬ 
tailment over linear-Htrans knowledge bases. Edges stand for 
p-atoms and arrays stand for configuration atoms, with the 
first and last elements corresponding to the begin and end 
terms. 

Then, the query just asks whether the begin of the initial 
configuration can be linked to the end of the initial configura¬ 
tion (i.e., Q = p{b, e)). 

This reduction shows that atomic CQ entailment over lin- 
ear-Htrans sets of rules is ExpTime-hard. □ 


















